Ultravox V0 6 Gemma 3 27b
MIT
Ultravox is a multimodal large speech language model that can process both speech and text inputs simultaneously, providing strong support for speech interaction scenarios.
Text-to-Audio
Transformers Supports Multiple Languages